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Abstract 


Background 


In the past decades, agricultural land abandonment and declining land-use intensity 
became common, especially in the Mediterranean countries of southern Europe. In some 
areas, this development opened up possibilities for rewilding and the recolonisation or 
expansion of large mammal populations. Yet, in some instances, co-occurrence of wild 
mammals and free-ranging domestic herbivores might lead to potential conflicts. It is, 
therefore, necessary to study the ecological interactions between wild and domestic 
mammal species to understand the effects of land abandonment and rewilding on 
biodiversity and ecosystem services. Camera traps are an effective tool for studying 
species interactions and occupancy dynamics as they allow for long-term monitoring with 
minimal interference. We conducted a long-term monitoring programme with camera traps 
in the Peneda-Gerés National Park in northern Portugal. The area has undergone 
substantial land-use changes following the abandonment of agricultural areas in the past 
60 years. While agro-pastoral activities, especially the breeding of free-ranging horses and 
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cattle, are still common in the area, the intensity of these activities has decreased 
significantly, promoting natural succession and an increase or return of several large 
mammal species in recent years. Overall, our project aims at: (1) assessing the population 
trends of the medium and large sized mammals in the area over time; (2) analysing the 
effects of passive rewilding on occurrence, abundance and behaviour; and (3) 
understanding potential interactions or conflicts between wild and domestic herbivores. In 
this publication, we present results of a primary occupancy analysis between 2015 and 
2020, as well as a comparison between occupancy and density estimates for 2019. 


New information 


Our publication provides a dataset from long-term camera-trap monitoring in the Peneda- 
Gerés National Park between 2015 and 2021. We established a 16 km? grid of 64 cameras 
deployed yearly during the summer months. Together with this publication, we publish the 
data and images collected between 2015 and 2021, using both the Camtrap DP standard 
and the GBIF Darwin Event Core. We obtained a total of 934,810 pictures on 41,234 trap 
nights. The pictures were automatically grouped into sequences with each sequence 
representing a distinct occurrence event, resulting in 80,191 occurrences. Out of those, 
14,442 contained observations of a species, while the remaining were either blank or the 
species was not identifiable. We only obtained the information whether a species was 
present or absent on a picture, disregarding the number of individuals. Most observations 
were of domestic cattle (Bos taurus) and horses (Equus caballus), followed by European 
roe deer (Capreolus capreolus) and wild boar (Sus scrofa). Further observations include 
red fox (Vulpes vulpes), gray wolf (Canis lupus), Eurasian badger (Meles meles), stone 
marten (Martes foina), common genet (Genetta genetta), Iberian ibex (Capra pyrenaica) 
and red deer (Cervus elaphus). We estimated occupancy and densities for the most 
common species. The project is on-going and additional data will be included in the future. 
The dataset is freely available for ecological analysis, but also for training machine-learning 
systems in automated image classification as all pictures have been manually classified. 
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Introduction 


Agricultural land abandonment and decreasing land-use intensity have been an 
increasingly important issue in Europe especially in Mediterranean countries, such as 
Spain (Gabarron-Galeote et al. 2015) and Portugal (Nunes et al. 2010). Between 1990 and 
2006, a total area of almost 120,000 ha was affected by abandonment processes in 
southern Europe (Hatna and Bakker 2011). This land abandonment is often associated 
with significant changes in biodiversity and ecosystem services. It can, for example, open 
up possibilites for reforestation or rewilding and the recolonisation or growth of large 
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mammal communities (Navarro and Pereira 2015). This might lead to potential conflicts 
with low-intensity agricultural practices, such as the breeding of free-ranging domestic 
herbivores. Understanding the long-term impacts of the de-intensification or abandonment 
of agricultural areas on wild mammal populations, as well as the potential interactions with 
domestic herbivore species in the area is, therefore, important for evaluating the effects of 
abandonment on biodiversity and ecosystem services. The Peneda-Gerés National Park in 
northern Portugal underwent such land-use changes following a rural exodus and 
abandonment of agricultural areas since the 1950s. This promoted a natural regeneration 
of oak forests, an increase in some local mammal populations and opened possibilities for 
passive rewilding of the area. For example, in 1997, the lberian ibex (Capra pyrenaica) 
was re-introduced in Galicia, Spain close to the Portuguese border and is, since then, 
recolonising the area (Fonseca et al. 2017). Yet, the breeding of free-ranging cattle and 
horses is still a common agricultural practice. This creates opportunities to study and 
monitor the effects of domesticated herbivores on the occurrence, density and behaviour of 
local wild mammal species and vice versa. 


Camera traps have proven to be an effective tool for this monitoring as they allow 
researchers to collect data while limiting interference with animals. They can provide 
information on distribution, behaviour, species richness or population dynamics (Rowcliffe 
and Carbone 2008; O’Connell et al. 2011; Burton et al. 2015; Caravaggi et al. 2017). 
Moreover, they improve the monitoring of elusive species that tend to avoid human 
observers and increase comparability and replicability as observations are stored as visual 
information. The information obtained can be used for a great spectrum of different 
ecological analyses ranging from occupancy modelling (MacKenzie et al. 2002) over the 
estimation of abundances of marked (e.g. capture-recapture, Karanth (1995)) and even 
unmarked animals (see Gilbert et al. (2020) for a review of recently developed methods). 


We developed a long-term wildlife monitoring programme in the Peneda-Gerés National 
Park using camera traps. The programme was initiated in 2015 and, since then, camera 
traps are deployed every year during the summer months (April - October). The aim is to 
analyse temporal trends throughout the study duration. For this study, we were interested 
in obtaining occupancy as well as density estimates for the species observed and 
comparing the two metrics. Occupancy is a frequently-used metric in monitoring 
programmes to understand how a species is distributed in space because it can be 
obtained from presence-absence data of even unmarked individuals. Yet, the density or 
abundance of species contains a higher informational value regarding the viability of a 
population and is directly comparable through time and space. We applied both occupancy 
modelling following MacKenzie et al. (2002), as well as camera trap distance sampling 
(CTDS) following Howe et al. (2017). 


General description 


Purpose: The long-term monitoring project aims to: (1) assess the population trends of the 
medium- and large-size mammals of the region over time; (2) analyse the effects of 
passive rewilding and other environmental variables on their occurrence and abundance; 
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(3) look at potential interactions between wild and domestic species and (4) analyse effects 
of environmental and anthropogenic variables on their behaviour (e.g. activity patterns). In 
this publication, we focus on analysing occupancy trends of the observed species between 
2015 and 2020, as well as comparing occupancy estimates with density estimates 
obtained in 2019. 


Project description 


Study area description: The study was conducted in the parishes of Castro Laboreiro and 
Lamas de Mouro in the Peneda-Gerés National Park in northern Portugal (Fig. 1). The 
Park was created in 1971 with the aim to protect the high natural value landscape. It is the 
oldest protected area and the only national park in Portugal. Over the past 60 years, the 
region has undergone substantial land-use changes (van der Zanden et al. 2018). These 
changes were especially marked in the area surrounding Castro Laboreiro, where 
historically, agro-pastoral activities were common, with seasonal migration from summer 
villages at the plateau to winter villages in the valley (van der Zanden et al. 2018) and 
socio-economic changes had led to a large population decline (Rodrigues 2010). This 
abandonment opened possibilities for natural succession and passive rewilding in the area. 
Today, the area supports a diversity of habitats consisting of small agricultural fields in the 
valley, shrublands and oak forest patches in the hillside and pastures for cattle and 
agricultural fields in the plateau (Rodrigues 2010). Since 1971, Castro Laboreiro is part of 
the Peneda-Gerés National Park and since 1997, it is also part of the European protected 
area network Natura 2000. 


A Camera traps 


Land use types 
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Figure 1. EES] 


A) Location of the Peneda-Gerés National Park. B) Location of the Castro Laboreiro and 
Lamas de Mouro parishes. C) Camera-trap locations and land-use types within the survey 
area. Cameras were placed randomly with regard to animal density and activity, but the 
locations were chosen in a way to represent the different land-use types in the area relative to 
their overall occurrence. 


The elevation in the area ranges from 300 to 1,340 m (van der Zanden et al. 2018). As it is 
located at the transition between the Mediterranean and Atlantic biogeographic zones, the 
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region has a temperate Mediterranean climate, characterised by cold and rainy winters and 
warm summers (van der Zanden et al. 2018). The average annual temperature throughout 
the study duration was 11.9°C at 800 m elevation, with an average annual precipitation of 
1858 mm between 1985 and 2015 (SNIRH - Sistema Nacional de Informagao de Recursos 
Hidricos 2022). Within the camera-trap grid, we identified seven different land-use types 
from aerial imagery in 2020, with bare rock accounting for almost half of the area (45.30%) 
followed by low shrub (20.13%) and oak forest (18.77%). While agriculture (2.42%) and 
urban infrastructure (1.11%) only represent a considerably small part of the area, high 
shrub and pine forest make up for 8.64% and 3.63%, respectively (Fig. 1). 


Sampling methods 


Description: The dataset is obtained from a long-term monitoring campaign that has been 
conducted every year since 2015. Currently, images from 2015 to 2021 are classified, but 
further data will be included in the future. In 2015 and 2016, camera traps were deployed 
from April to August and from 2017 to 2019 from May to October (see Table 1 for details). 
In 2020, because of travel restrictions during the Covid-19 pandemic, camera traps could 
only be deployed in June and were left in the field until May 2021. Due to theft and 
malfunction, the number of operative cameras ranged from 61 in 2016 to 48 in 2020/21. 
Trap-nights per year ranged from 4,236 in 2015 to 9,606 in 2020/21. 


Table 1. 


Summary statistics of all years. *observations of humans are excluded. 


Year No. Deployed Ended Trap- Pictures Sequences Sequences 
Cameras days total* observation 

2015 58 19.04.2015 19.08.2015 4,236 295,562 16,928 2,850 

2016 61 13.04.2016 27.08.2016 6,744 280,239 18,129 3,761 

2017 54 08.05.2017 03.10.2017 7,169 52,542 4,608 1,715 

2018 58 15.05.2018 15.10.2018 6,649 31,437 = 11,238 1,283 

2019 57 07.05.2019 08.10.2019 6,830 175,443 12,364 2,902 

2020/21 48 02.06.2020 07.05.2021 9,606 99,587 16,924 1,931 

All 41,234 934,810 80,191 14,442 


Sampling description: For the study, 64 camera traps (Reconyx Hyperfire HC600, 
Holmen, WI, USA) were deployed in a 16 km? grid southwest of Castro Laboreiro. They 
were distributed as uniformly as possible across the different land-use types (e.g. 10% of 
cameras in land-use types that cover 10% of the area) with one camera per 0.25 ha grid 
cell (approx. 500 m spacing between each camera, Fig. 1). Real locations could deviate by 
up to 100 metres from the planned locations due to accessibility and placement 
possibilities. Further, some locations had to be adjusted throughout the years because of 
changes in the vegetation structure or due to theft, but new locations were within 100 m of 
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the original location and placed in similar habitats. The coordinates in the dataset were 
rounded to three decimals to protect the camera traps from theft. 


The cameras remained active for 24 hours per day and were programmed on motion 
sensor to take three consecutive pictures each time they were triggered by an animal with 
no delay after a trigger event. The sensitivity of the sensor was set to high in 2015, 2016 
and 2019, medium in 2017 and 2018 and medium/high in 2020 (see eventRemarks). 
Sampling effort was measured as the number of camera traps multiplied by the number of 
days they remained active (Rovero et al. 2010). 


Quality control: To ensure using the updated scientific name and common name of 
species, the taxonomic nomenclature followed the Catalogue of Life (https:// 
www.catalogueoflife.org). Additionally, we checked every species in the database of the 
IUCN Red List of Threatened species (https:/Avww.iucnredlist.org) for their conservation 
status and populations trends. 


Step description: Each image obtained from the camera traps was classified manually. 
The images were later imported into Agouti (Casaer et al. 2019, https:/Avww.agouti.eu) to 
be made publicly available. There they were automatically grouped into sequences of 
contiguous images with each sequence representing one distinct occurrence event. The 
pre-classified observations were linked to the respective sequences using the image name. 
Starting with the data from 2020, the images were directly imported into Agouti and 
classified within the software. The dataset was exported from Agouti in Camtrap DP format 
(Camtrap DP Development Team 2021) and manually converted to Darwin Core standard 
(Darwin Core Task Group 2009). It is structured as a sample event dataset including the 
event and the occurrence data and published as a Darwin Core Archive (DwC-A). Here, an 
event refers to a camera-trap deployment at a certain location over a certain amount of 
time (equivalent to deployment in Camtrap DP format) and an occurrence refers to a 
distinct occurrence event (here: sequence, equivalent to observation in Camtrap DP 
format). The published DwC-A dataset only includes occurrences that contained 
observations of an animal. As the DwC-A format currently does not allow for hierarchical 
datasets with more than two levels, we included the first ten images of each occurrence 
event as associatedMedia in the occurrence table. The original Camtrap DP dataset 
including also blank and unknown occurrences will be published as supplemental material 
to this publication (Suppl. material 4, Suppl. material 5) and made available through GBIF 
in the future. The media file containing information and identifiers for all images will be 
published once the Camtrap DP integration in GBIF is possible and is, until then, available 
upon request. 


Geographic coverage 


Description: Castro Laboreiro and Lamas de Mouro, Peneda-Gerés National Park, 
Portugal. 


Coordinates: 41.997 and 42.036 Latitude; -8.204 and -8.158 Longitude. 
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Taxonomic coverage 
Description: Mammals and birds were identified to the species level where possible. 


Taxa included: 


Rank Scientific Name Common Name 
class Mammalia Mammals 
class Aves Birds 


Temporal coverage 


Data range: 2015-4-19 - 2015-8-19; 2016-4-13 - 2016-8-27; 2017-5-08 - 2017-10-03; 


2018-5-15 - 2018-10-15; 2019-5-07 - 2019-10-08; 2020-6-02 - 2021-5-07. 


Usage licence 


Usage licence: Creative Commons Public Domain Waiver (CC-Zero) 


Data resources 


Data package title: Long-term monitoring of mammal communities in the Peneda-Gerés 


National Park using camera-trap data 


Number of data sets: 2 


Data set name: CT_Peneda_events 


Download URL: https://doi.org/10.15468/rah33) 


Data format: Darwin Core Archive format 
Data format version: Version 1 


Description: The dataset is published in the Global Biodiversity Information Facility 
platform, GBIF (Zuleger et al. 2023). It includes all observations of a species where 
classification was possible. Observations of humans are excluded from the dataset. It 
is structured as a sample event dataset that has been published as a Darwin Core 
Archive (DwC-A), which is a standardised format for sharing biodiversity data as a set 
of one or more data tables. This data file contains the 331 camera-trap deployments 
during a certain year (eventID) from which camera-trap images were obtained. The 
GBIF IPT (Integrated Publishing Toolkit, Version 2.5.6-rd6f172f) archives the data and 
thus serves as the data repository. The data and resource metadata are available for 
download from GBIF. Additionally, the original Camtrap DP dataset including 
deployment and observation data is made available as supplemental data to this 
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publication. The media data will be made available through GBIF once the integration 
of the Camtrap DP format is possible and is, until then, available upon request. 


Column label 
eventID 
locationID 
decimalLatitude 
decimalLongitude 


geodeticDatum 


coordinateUncertaintyInMetres 


verbatimCoordinates 


verbatimCoordinateSystem 


verbatinSRS 


eventDate 


higherGeography 


continent 
country 
countryCode 


stateProvince 


county 


municipality 


locality 


verbatimElevation 


Column description 

Unique identifier for the set of information associated with the event. 

Identifier for the location information, here: Camera-trap location in a certain year. 
Geographic latitude (in decimal degrees). 

Geographic longitude (in decimal degrees). 


Geodetic datum or spatial reference system (SRS) upon which the geographic 
coordinates given in decimalLatitude and decimalLongitude are based, here: 
WGS84. 


The horizontal distance (in metres) from the given decimalLatitude and 
decimalLongitude describing the smallest circle containing the whole of the 


Location, here: 100 m as coordinates were rounded to three decimals. 
Verbatim original spatial coordinates of the Location. 


Coordinate format for the verbatimCoordinates of the Location, here: decimal 


degrees. 


The ellipsoid, geodetic datum or spatial reference system (SRS) upon which 
coordinates given in verbatimLatitude and verbatimLongitude, or 


verbatimCoordinates are based, here: WGS84. 


Interval during which an Event occurred, here: time camera was deployed and 


functional. 


List of geographic names less specific than the information captured in the locality 


term. 

Name of the continent in which the Location occurs, here: Europe. 
Name of the country in which the Location occurs, here: Portugal. 
Standard code for the country in which the Location occurs, here: PT. 


Name of the next smaller administrative region than country (state, province, 


canton, department, region etc.) in which the Location occurs, here: Norte. 


Full, unabbreviated name of the next smaller administrative region than province 
(county, shire, department etc.) in which the Location occurs, here: Viana do 


Castelo. 


Full, unabbreviated name of the next smaller administrative region than county 


(city, municipality etc.) in which the Location occurs. 
Specific description of the place. 


Elevation (above sea level) of the Location in metres. 
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ownerInstitutionCode Name of the institution having ownership of the information referred to in the 


record, here: iDiv. 


samplingProtocol Protocol used during an Event, here: camera traps. 
samplingEffort The amount of effort expended during an Event. 
sampleSizeValue Numeric value for a measurement of the size of a sample in a sampling event, 


here: number of trap nights. 
samplieSizeUnit Unit of measurement of the size of a sample in a sampling event, here: trap-nights. 


eventRemarks Additonal remarks regarding the setting of the camera traps, here: Trigger 


Sensitivity setting of the camera trap. 


Data set name: CT_Peneda_occurrences 
Data format: Darwin Core Archive format 
Data format version: Version 1 


Description: The dataset is published in the Global Biodiversity Information Facility 
platform, GBIF (Zuleger et al. 2023). It includes all observations of a species where 
classification was possible. Observations of humans are excluded from the dataset. It 
is structured as a sample event dataset that has been published as a Darwin Core 
Archive (DwC-A), which is a standardised format for sharing biodiversity data as a set 
of one or more data tables. This data file contains the 14,509 sequences 
(occurrencelD) on which a species was observed (occurrenceStatus = present). It is 
linked to the camera trap deployments through the eventID. The dataset also includes 
the information on the respective species as well as the first ten images associated 
with this occurrence. The data and resource metadata are available for download from 
GBIF. Additionally, the original Camtrap DP dataset including deployment and 
observation data (including also blank sequences) is made available as supplemental 
data to this publication. The media data will be made available through GBIF once the 
integration of the Camtrap DP format is possible and is, until then, available upon 
request. 


Column label Column description 
eventID Unique identifier for the set of information associated with the event. 
occurrencelD Unique identifier for an occurrence, here: sequence of images referring to a distinct 


occurrence event. 


basisOfRecord The specific nature of the data record, here: MachineObservation. 
occurrenceStatus A statement about the presence or absence of a Taxon at a Location, here only presences 
are included. 


eventDate Date when the event was recorded. 


10 


year 
month 

day 

eventTime 
samplingProtocol 
occurrenceRemarks 


taxonID 


identificationRemarks 


identifiedBy 
organismName 
scientificName 


higherClassification 


kingdom 
phylum 

class 

order 

family 

genus 
specificEpithet 


infraspecificEpithet 


taxonRank 


associatedMedia 
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The four-digit year in which the Event occurred. 

The integer month in which the Event occurred. 

The integer day of the month on which the Event occurred. 

Time at which the event occurred. 

Description of the methods or protocols used during an event, here: camera trap. 
Comments or notes about the Occurrence, here: captureMethod: motion detection. 
Identifier for the set of taxon information (data associated with the Taxon class). 
Comments or notes about the Identification, here: classificationMethod: human or machine. 
Person, group or organisation who assigned the Taxon to the subject. 

Textual name or label assigned to an Organism instance. 

Full scientific name in lowest level taxonomic rank that can be determined. 


A list (concatenated and separated) of taxa names terminating at the rank immediately 


superior to the taxon referenced in the taxon record. 

Full scientific name of the kingdom in which the taxon is classified. 

Full scientific name of the phylum or division in which the taxon is classified. 
Full scientific name of the class in which the taxon is classified. 

Full scientific name of the order in which the taxon is classified. 

Full scientific name of the family in which the taxon is classified. 

Full scientific name of the genus in which the taxon is classified. 

Name of the first or species epithet of the scientificName. 


Name of the lowest or terminal infraspecific epithet of the scientificName, excluding any 


rank designation. 
Taxonomic rank of the most specific name in the scientificName. 


A list (concatenated and separated) of identifiers (URL) of media associated with the 
occurrence, here: the first 10 images of an occurrence (if more than 10 images were 
obtained). All images can be obtained from the Camtrap DP media file, which will be 
published through GBIF once the integration of the Camtrap DP format is available and is, 


until then, available upon request. 


Additional information 


Analysis 


The purpose of this study was to obtain an estimate of naive occupancy and abundance 
while accounting for imperfect detection and without relying on individual identification of 
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the animals. For occupancy, we followed the modelling approach of MacKenzie et al. 
(2002). It assumes that, if a species is detected at a location, it can be considered present. 
Yet, non-detection of a species does not equal absence as the species might have gone 
undetected by the observer. The model, therefore, uses repeat sampling and the obtained 
detections and non-detections at each site on each survey to estimate the expected 
occupancy probability a) and the probability of detecting the species given that it is 
present (P). In doing so, it allows the estimation of the probability of site occupancy when 
the detection probabilities are <1. 


We fitted a single-season occupancy model for each year separately using a Maximum 
Likelihood framework and calculated the Maximum Likelihood for @ and P following 
MacKenzie et al. (2002) by: 


L(y, = |" Tor a-po"™) «fuTra pr) + (1— v) 


t=1 
Eq. 1 


with time (t), the total number of surveyed sites (N ), the number of distinct sampling 
occasions (T), the number of sites where the species was detected at time t (Te) and the 
total number of sites at which the species was detected at least once (Td -). The detection 
matrices of presence/absence records used for analysis consisted of one record per 
species, camera location and day. We used the R package unmarked (Fiske and Chandler 
2011) to estimate occupancy and detection probabilities, as well as standard errors (SE). R 
code for the occupancy analysis is available in Suppl. material 6. 


For the estimation of population densities, we applied the camera-trap distance sampling 
(CTDS) methodology developed by Howe et al. (2017) to the data from 2019. CTDS 
applies the distance sampling framework by Buckland et al. (2004) without the need for 
individual identification and allows us to account for imperfect detection. Here, a camera 
trap which is deployed at point k for a period of time Ti, serves as the observer. Density 
can be estimated as: 


K 
Bf. ys Nk 
rn k=1 
D= z 
Ow? ws Ti, Py 
k=1 
Eq. 2 


where Qis the horizontal angle of view of the camera model, ‘W is the truncation distance 
beyond which observations are discarded, Tk is the number of animals recorded at point 
ke and Py is the probability at each snapshot moment that an animal within the survey area 
is detected between 0 and ‘UU in front of the camera (Howe et al. 2017). 


We measured the distance to the camera of each animal present on a picture by visually 
comparing them to reference distances recorded during the deployment in that year. A 
detection function was fitted to the observation distances in the software Distance 7.3 
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(Thomas et al. 2010) using a conventional distance sampling approach with a point 
transect model. Two different key functions (Hazard-rate and Half-normal) with up to two 
cosine adjustment terms were applied and non-parametric bootstrapping with 999 
iterations generated by sampling with replacement from the obtained data being used to 
calculate the coefficient of variation (Buckland et al. 2004). We estimated P as the mean 
detection probability of all models weighted by their AIC (Akaike Information Criterion; 
Akaike (1973)). The density estimates were corrected for the limited availability of species 
(Howe et al. 2017) by estimating an availability factor CL which was derived by calculating 
the proportion of time animals spend active (activity level), following Rowcliffe et al. (2014). 


Additionally, we calculated the mean encounter rate Ee (number of observations per 
camera) across all cameras and the Coefficient of Variation (CV) of the density estimates 
following the delta method (Dorfman 1938): 


cv (B) =CV(E)?+CV (6) + CV (a) 
Eq. 3 


The CV of the encounter rate was obtained following the approach of Fewster et al. (2008) 
for estimating the encounter rate variance in distance sampling. The CV of the detection 
probability was calculated as the CV of all models weighted by their AIC. The CV of the 
activity level was calculated using bootstrapping with 999 iterations resampling from the 
Original data (Rowcliffe et al. 2014). 


Results 


We collected a total of 934,810 pictures on 41,234 trap days with the majority of the 
pictures being collected in 2015 and 2016. In 2017 and 2018, we obtained considerably 
less images and observations probably due to the lower sensitivity setting of the camera 
traps (Table 1). Grouping the pictures into sequences led to a total of 80,191 distinct 
occurrence events with a mean of 13.2 pictures per event. Out of those, 14,442 contained 
observations of an animal species. In total, we observed ten wild mammal species and six 
domestic species (Table 2). The most frequently observed species were domestic cattle 
(4,572) and domestic horses (4,554), followed by European roe deer (2,548) and wild boar 
(1,891). 


Table 2. 


Number of sequences (observations) of each species per year and in total. 


Species 2015 2016 2017 2018 2019 2020-2021 Total 
Wild 

European roe deer Capreolus capreolus 463 520 150 156 616 643 2,548 
Wild boar Sus scrofa 267 286 138 169 677 354 1,891 


Red fox Vulpes vulpes 92 100 1 5 106 72 376 
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Species 2015 2016 2017 1.2018 2019 2020-2021 Total 
Gray wolf Canis lupus 14 40 5 19 37 5 120 
Stone marten Martes foina 4 5 0 1 6 0 16 
Iberian ibex Capra pyrenaica 0 3 0 2 9 38 52 
Common genet Genetta genetta 0 2 0 0 5 3 10 
Red deer Cervus elaphus 0 0 0 5 4 0 9 
Eurasian badger Meles meles 1 0 0 0 3 5 9 
European rabbit Oryctolagus cuniculus 2 6 0 0 0 0 8 
Domestic 
Domestic cattle Bos taurus 1,297 1,369 327 460 764 355 4,572 
Domestic horse Equus caballus 660 1,320 1,088 452 617 417 4,554 
Domestic dog Canis lupus familiaris 38 68 1 2 14 21 144 
Domestic sheep Ovis aries 3 18 4 1 0 0 26 
Domestic goat Capra hircus 1 5 1 7 9 0 23 
Domestic cat Felis catus 0 0 0 1 1 4 6 
Other 
Birds Aves 8 13 0 3 8 11 43 
Rodents Rodentia 0 5 0 0 25 3 33 
Lizards Reptilia 0 1 0 0 1 0 2 


Detection probabilities obtained from occupancy modelling were generally low across 
species and approached zero for species with fewest observations (Fig. 2). The highest 
occupancy estimates throughout the study duration were obtained for European roe deer 
(0.72 - 1) and wild boar (0.48 - 0.92) (Fig. 2), yet their densities were considerably low 
(Table 3). In contrast to that, domestic cattle and horses showed very high density 
estimates, while their occupancy was rather low (0.47 - 0.72 and 0.46 - 0.73, respectively). 
It seems that the domestic species are slightly decreasing in occupancy over the years, 
while the wild species remain mostly constant. For all other species, we obtained 
comparatively low occupancy estimates over the years, but for many of them, the number 
of observations is too low to make any assumptions about temporal trends at this point 
(Table 2). In 2017, 2018 and 2020, most of the species showed lower occupancy estimates 
than in other years. A Spearman rank correlation analysis revealed that, for the more 
common species (e.g. cattle, horses, roe deer and wild boar), detection probabilities were 
positively correlated with the sensitivity setting of the cameras in that year (R = 0.5, p = 
0.002). Yet, we found no significant correlation between occupancy and sensitivity (R = 
0.11, p = 0.55). All occupancy and detection probabilities are presented in Suppl. 
material 1. 
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When looking at the density analysis, the obtained distances (Suppl. material 2) showed 
that many cameras only triggered up to 10 metres, even though the motion sensor range 
was reported to be up to 30 metres (manufacturer's information). For all species, 
observation distances decreased with increasing distance. Yet, for foxes and wild boar, we 
obtained very few observations close to the camera, indicating that some animals might 
have gone undetected at short distances. For both red fox and gray wolf, the density 
estimates were low with confidence intervals overlapping zero. Therefore, the results 
should be observed with care. Information and results of each model contributing to the 
final estimates are presented in Suppl. material 3. 


Comparing the occupancy and the density estimates from 2019, we did not observe a 
strong relationship between them when including all six species in the analysis (R = 0.54, p 
= 0.30). Yet when only including the wild species, there is a very strong and marginally 
significant positive correlation between occupancy and density (R = 1, p = 0.08, Fig. 3). 
With higher densities, the occupancy probability and, therefore, the amount of space used 
by the population increases. 
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A) Detection (p) and B) occupancy () probabilities for every species per year (see Suppl. 
material 1 for all values). Transparent bars mark models with too few observations to obtain 
reasonable results. 
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Table 3. 

Density estimates of the six most common species in 2019. n = sample size (total number of first 
trigger images), Nmodel = Number of first trigger images included in the distance sampling model 
after truncation, ‘Z = mean encounter rate, a = activity level / availability factor, p = detection 
probability, D = density (individuals per km?), CV = coefficient of variation. 


Species n Nmodet € [CV] a [CV] p [CV] D [CV] 
Gray wolf Canis lupus 88 77 0.22 [34%] 0.35[9%] 0.24 [44%] 0.15 [56%] 
Red fox Vulpes vulpes 146 114 0.16 [19%] 0.53 [11%] 0.16 [23%] 0.30 [32%] 


European roe deer Capreoluscapreolus 1,109 672 0.83 [16%] 0.44[4%] 0.06[8%] 5.88 [18%] 


Wild boar Sus scrofa 2,057 1,754 1.01[21%] 0.44[3%]  0.25[5%] 2.59 [22%] 
Domestic horse | Equus caballus 6,801 2,610 6.19[25%] 0.55[2%]  0.11[3%] 10.89 [25%] 
Domestic cattle Bos taurus 6,321 1,046 5.19[28%] 0.36[2%] 0.13[7%] 23.39 [29%] 
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Comparison between occupancy estimates obtained from occupancy modelling (MacKenzie et 
al. 2002) and density estimates obtained through camera-trap distance sampling (Howe et al. 
2017) for the six most common species in the study area in 2019. 


Discussion 


We are presenting an extensive dataset obtained from long-term camera trapping that will 
be maintained and extended in the future. The dataset will offer the opportunity to study 
local changes in occupancy and density, as well as responses to changes in land-use and 
vegetation structure or anthropogenic influences on a diverse mammal community. It is 
currently published in the Darwin Core Archive (DwC-A) format, but GBIF is in the process 


16 Zuleger A et al 


of implementing a new data model allowing for improved presentation of camera-trap data, 
as well as the upload of Camtrap DP data in the Integrated Publishing Toolkit (IPT). As the 
DwC-A standard is not very well suited for camera trap data, for example, by not allowing 
the hierarchical structure of deployments, media and observations and the manual 
conversion from Camtrap DP to DwC-A has proven to be challenging (e.g. it was only 
possible to include images as associatedMedia in the occurrences and not as a distinct 
source data), we will adopt the new data model once available. Our dataset can then be 
used for the training for automated computer classification systems, as all images will be 
made available online. 


The very simple index of occupancy presented in this paper already shows that there might 
be certain trends, such as a decrease in the space used by domestic species and a slightly 
increasing occupancy of species, such as the red deer or the Iberian ibex that are currently 
in the process of repopulating the area (Moco et al. 2006; Santos et al. 2011). The data 
from following years will hopefully render further insights into those processes, especially if 
also environmental variables or multiple species interactions are included in the analysis. 
However, we observed an effect of the sensitivity setting of the camera traps on the 
detection probabilities which needs to be explored in more depth in future analysis by 
incorporating it directly in modelling. Additionally, it needs to be considered that, because of 
the design of our camera trapping grid (cameras being only 500 m apart), the assumption 
of independence amongst sampling locations (MacKenzie et al. 2002) is very likely to be 
violated, so WV should be interpreted with care. 


Regarding the density estimates obtained from CTDS, our results were consistent with 
literature from the same or similar study areas. Unfortunately, for many species, there is no 
recent literature on population densities. For example, for the gray wolf the last national 
assessment of populations in Portugal was performed in 2002 - 2003 by Pimenta et al. 
(2005) using a combination of different indirect sampling methods and resulting in a much 
lower density estimate of 0.03 — 0.06 individuals/km? (compared to 0.15 individuals/km? in 
our study) for the study area. Yet, several studies have recorded an increase in wolf 
numbers and positive population growth rates in Portugal since 2007 (Campos 2018). For 
roe deer, a relatively recent study by Valente et al. (2014) estimated densities using the 
distance-sampling framework with data from pellet-group counts in the Montesinho Natural 
Park (approximately 100 km east of the survey area) and calculated a density of 3.5 (2.26 
— 5.45) individuals per km? (5.88 individuals/km? in our study). Many roe deer populations 
have increased greatly in Portugal within the last 20 years (Torres et al. 2015), so an 
increase in the Peneda-Gerés National Park due to changes in land-use practices and re- 
naturalisation of habitats is likely. To date, there are no studies estimating wild boar 
abundances or densities in northern Portugal. Bosch et al. (2012) and Pittiglio et al. (2018) 
have conducted surveys using habitat suitability models to predict wild boar densities in 
northern Portugal. Yet, their estimates were close to zero. This shows that modelled 
densities from remote sensing data, based on assumptions about habitat suitability, should 
always be interpreted with care. Data from the National Statistics Institute (Instituto 
Nacional de Estatistica (INE) 2019) reported cattle densities in the study area to be 18.76 
individuals/km?, ranging from 9 individuals/km? in the Lamas de Mouro and Castro 
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Laboreiro area to 37 individuals/km? in the Gaveira Region (Instituto Nacional de 
Estatistica (INE) 2019). Our estimates (approx. 23 individuals/km7?) lie within this interval. 
For horses, we also obtained data from the INE. Yet, these data seem to be 
underestimated as it only reports a density of 0.76 individuals/km? and about 115 horses 
for the region, whereas we obtained densities of almost 11 individuals/km?. 


For future analysis, we aim at analysing temporal trends, not only in occupancy, but also in 
abundance and investigate the occupancy-abundance-relationship in more depth. With the 
data from just one year, it was not possible to observe a relationship between those two 
metrics. Potential reasons are that we only have one estimate per species for the entire 
study area and the relationship might depend on species specific variables, such as home 
range or site specific variables, such as habitat (Efford and Dawson 2012). Another reason 
is that there can be time lags in the response of occupancy to increases and decreases in 
abundance that only become apparent from long-term monitoring initiatives (Gaston et al. 
2000). Yet, if only considering the wild species, we do observe that higher populations 
densities could be correlated with higher occupancy estimates. While the domestic species 
seem to appear in larger groups only in certain parts of the study area, leading to high 
density estimates with lower occupancy estimates, wild species seem to have higher 
densities with increasing occupancy. We expect that new technological advancements will 
soon make the application of CTDS to previously-collected data possible, so that we are 
planning to also obtain density estimates for the years of 2015 - 2018 and from 2020 
onwards. This should hopefully give us further insights into the relationship of occupancy 
and abundance. Additionally, to including data from several years, we plan to also include 
environmental variables, as well as potential interactions between wild and domestic 
species in the analysis, to gain a better understanding of the relationship on a finer spatial 
scale. 
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Brief description: All occupancy and detection probability estimates for each species per year 
including standard error (SE). 
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Brief description: Observed distances of the six most common mammal species used for the 
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Brief description: Camera trap deployments in Camtrap DP format. 

deploymentID - Unique identifier of the deployment. 

locationID - Unique identifier of the deployment location. 

locationName - Name given to the deployment location. 

longitude - Longitude of the deployment location in decimal degrees, using the WGS84 datum. 
latitude - Latitude of the deployment location in decimal degrees, using the WGS84 datum. 

start - Date and time at which the deployment was started. Formatted as an ISO 8601 string with 
timezone designator. 

end - Date and time at which the deployment was ended. Formatted as an ISO 8601 string with 
timezone designator. 
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Brief description: Observations in Camtrap DP format. 
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deploymentID - Unique identifier of the deployment the observation belongs to. 

sequencelD - Unique identifier of the sequence (collection of media files grouped by a predefined 
‘package.project.sequencelnterval’) that is the source of the observation. 

timestamp - Date and time of the observation. Formatted as an ISO 8601 string with timezone 
designator (YYYY-MM-DDThh:mm:ssZ or “YYYY-MM-DDThh:mm:ssthh:mm*). For file-based 
observations, this is the ‘media.timestamp’ of the associated media file (in “medialD’), for 
sequence-based observations the ‘media.timestamp’ of the first media file in the associated 
sequence (in ‘sequencelD’). 

observationType - Type of the observation.cameraSetup true’ if the observation is part of the 
camera setup process (camera deployment, pickup, maintenance). 

taxonlD - Unique identifier of the ‘scientificName’ as defined in ‘package.taxonomic.taxonID’ for 
that scientific name.scientificNameScientific name of the observed individual(s). 
classificationMethod - Classification method. 

classifiedBy - Name or unique identifier of the person or Al algorithm that classified the 
observation. 

classificationTimestamp - Date and time of the classification. Formatted as an ISO 8601 string 
with timezone designator. 

classificationConfidence - Accuracy confidence of the classification. Expressed as a probability, 
with *1° being maximum confidence. 

Download file (15.01 MB) 
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Data type: R Code 

Brief description: R Code for the occupancy analysis presented in this publication. 
Download file (28.98 kb) 


